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ABSTRACT 

This document reviews some of the indices of language 
development which have been constructed since the Anderson study of 
1937. In addition to the findings of Anderson, the T-unit proposed by 
Hunt (1965), the measure of T-unit length and mean number of 
sentence-combining transformations per T-unit developed by O'Donnell, 
Griffin, and Norris (1967), the instrument for collecting comparable 
samples of writing from children at var. us grade l**vels developed by 
O'Donnell (1968), the syntactic complexity scale developed by 
Endicott (1973), and the Syntactic Density Score developed by Golub 
and Kidder (1974) are discussed. (HOD) 
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The quest: for an objeocive ar.d reliable index of children's language 
dovolo;nn«3nt was hegun mwy years ago. In a study reported nearly forty 
years w^.go, Anaerscra (i9i7) ovai'aated by stacisticai rrtethods three indices " 
01 iangud^o uoveiw'pment . anaiyzei data on sentence length and pronoun 
index, but was primarily interested in the subordination index that had been 
dcvi::od a short time earlier by LaL^rant (1933 )• 

Anderson was concerned with the generality of LaBrant's index and in 
the possibility of developini^ an easily applied and uniform measuring device • 
Because of such varLibles as composition length, subject. matter , and 
situations i;i which lan^^uage is used, he drew negative conclusions about 
generalized application of the subordination index* He recognized, however, 
the existence of . . .a fertile field for the development of indices based 
on common subject matter, well-cate£?ori2ed scales, and adequate samples from 
the standpoint of length of passage/' 

Despito AnderLjon^s :'^ir4i:ig?> , subsequent investigations of children's 
lam-uagG relied heavily on r.ean sentence length and the subordination index j 
and, durin;^ the following, twenty-five years, not .nuch progress was made 
toward an easily applied and uniform index of language development* Techniques 
of lliiguliirir atiaJvsis devt^u^p^d in the early 1960*s however^ offered new 
hope that such an index could be devised. 

In a study of children's grammatical structures written at three grade 
levels, Hunt (1965) gave a critical evaluation of the traditional indices 
of langua;^e development and proposed some new indices* Pointing out the 
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fact that inadequate punctuation and indiscriminate use of and makes 
;^ontonce ienr,th <in unreliable index, Hun^ proposed a syntactic unit consisting 
of one main clause and any subordinate ola-j.^as attached to the main clause* 
Such a unit would be gramatically capable of being considered a sentence; 
and sinct^ it wouiu i^e a terminable unit, he proposed that it be called a 

T-unit , This unit could be identified objectively, and it would not be 

I 

affected by poor punctuation • It would have the added advantage of preserving 
all the subordination achieved by the student and all of his coordination of 
words, phrases, and subordinate clauses • It would not preserve the student *s 
coordiaation of main clauses; but, as Hunt pointed out, excessive coordination 
of main clauses might be a sii^n of immaturity rather than maturity • Thus, 
the elimination of coordinated main clauses from the syntactic unit to 
be used in language development studies would be a gain rather than a loss* 

Hunt called attention to the fact that LaBrant counted coordinated 
verbs as separate clauses; he contended that only a structure with a subject 
and a finite verb should be re,fnirded as a clause* Then, having identified 
a syntactic unit more useful than the sentence and having limited the 
definition of the .:iaase, Hunt proposed a revision of the subordination 
index* He proposed that, instead of dividing the number of subordinate 
clauses by the total number of clauses as LaBrant had done, the total number 
oJ clauses snouid be ^Jivided ty the number of main clauses* This procedure 
wcmld f>i-Vv} th^ inoan /.urriber oi clauses per T-unit, which could be converted 
into the ratio of suborUr.ate clauses to main clauses* 

Hunt computed tbte-^^b proposed indices for the materials analyzed in 
his study and found str t Lwticaily significant increases in T-unit length, 
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clause length, and number of clauses per T-unit from grade four to grade 
eight to grade twelve • He pointed out that these three measures are 
interrelated. T-units can be lengthened by either or both of two means J 
(1) lengthening clauses or (2-) increasing the number of subordinate clauses* 
His analysis, basev. o:\ techni :ues of transformational-generative grammar > 
revealed that, syntactic cor.plexity, of children's language is reflected in 
both clause LMhUli anu nuroer of clauses per T-unit, Hunt demonstrated that 
T-unit length could be computed by multiplying words per clause by number 
of clauses per T-unit. Since T-unit length incorporated the other two' 
indices, it seemed evident that T-unit length would be a useful index of 
structural complexity of language, 

O'Donnoll, Oriffln, and Korris (1967), in their study of children's 
syntax, computed botn mean T-unit length and mean number of sentence- 
combining transformations per T-unit. Their rationale for the latter measure 
is stated in part as follows: ; 

There are transformational rules which designate operations 
affecting two underlying strings so as to join them or embed 
one in the other. The process is often called generalized 
transformation, but it is also referred to as sentence-combining 
transformation, because its effect is to produce one sentence 
where otherwise there v/ould have been two (p.lG)* 

Except for coordination of main clauses, sentence-combining 
• transformations may be conceived as embedding one kernel sentence 
(often, though not always, in reduced form) into another in ways 
determined by the rules of grammar* This embedding increases 
the information carrying pov/er of the resulting construction. 
It may well be supposed, then, that at least for children, the 
relative density of these transformations within T-units signalizes 
the degree of maturity attained (p. 50). 



BEST COPY AVAIIABLE 

These investigators fcund a positive correlation between increases 
in n\.v'.\i^-3v oi wurU-i v^v i'-unit and nutnbsr of sentence-combining transformations ^ 
per T-unit. Since the two measures appeared to be of comparable value 
as indices of language development, they indicated a preference for mean 
T-unit length because? it was easier to cqijipute. They concluded that "mean 
length of T-units has special claim to consideration as a simple* objective, 
valid indicator or d>.^v>5iopr.iant in syntactic control" (pp .•98-99). 

In order to see v/hether the indices Hunt proposed could be computed 
from smaller writing samples, C'Donnell (1968) devised an experimental 
instrument for i2oli»2ctinr, co!::parable samples of writing from children at 
various grade levels. This instrument required the rewriting of a passage 
composed of simple declarative sentences. Subjects were asked to rewrite the 
passage, putting the short sentences together to make longer sentences of 
varying degrees of structural -»2omplexity. 

Data resulting from use of the sentence-combining instrument indicated 
that both clause length and number of clauses per T-unit increased together 
at the lower grade levels, but that clause length alone accounted for most 
of the growth in complexity at higher grade levels. Since T-unit length 
is accounted for by a combination of the two factors., T-unit length was 
ludftod to be mora .usof-ui cnan either of the otner two measures alone as an 
index of growth in structural complexity over a wide age-range. 

In commenting on the results of his study, O^Donnell pointed out both 

the potential value and the limitations of these indices of language development} 

Although it is i?vident that clause length, T-unit length and 
number of clausies per T-unit increase with advance in 'grade, 
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... there are no data to show hov; consistently th^se indexes 
measure the structural complexity of an individual student's 
writing in various situations (p. 6). 

But in spite of their limitations, these indices (particularly T-unit length) " 
were subsequently used in numerous studies of children's language development. 

Another limitation, "obvious to anyone who has attempted to analyze 
the structure of children's language, lies in the fact that indices based 
on moan length of syntactic units do not discriminate among the various 
ways length can be achieved. The T-unit is more satisfactory in this 
respect than the sentence, but the T-unit can be lengthened in a variety 
of ways, some of wnicn require a great jieal more linguistic maturity than 
c:hers. The desire to find a more discriminating index of language 
development has motivated researchers to attempt to develop other indices 
of structural growtn. 

One index that appears to have potential capacity to reflect differences 
in degree of complexity is that proposed by Endlcott (1973). He accepts 
the T-unit as a consistent and useful research tool and offers a psycho- ^ 
linguistic definition of the T-unit as: "the extent to which a child 
combines units of complexity in language involving suspension of thought 
and mental manipulation of syntactic structures before he breaks off and 
begins again" (p. 6). 

"Endicott advances a theoretical model Intended to allow the defining 
of units of syntactic complexity in psycholinguistlc terms. This involves 
the development of a rationale for defining such units, which he hopes will 
result in a scale of syntactic complexity that will be useable by researeheM 
in language development, psycholingulstlcs, and readlbllity. Endlcott 's 
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tneoreticai mooe^u i^s^z on t ansforr.ational and morphemic analyses, 
sinco \w co\\<u\s:V3 nui; v/'.lation of transformations and the suspension 
or production of morphemes as pnases of mental operations • 

Ti.o uriiti.; iJi.aLcoLt. proposes lu use in constructing the syntactic 
^ scale are cJefined i^eicw* 

Co^meme : A unit of complexity in language consisting-of four sub- 
categories: The base co-rr.eme, the syntactic co-meme, the compression 
co-meme, and the morphemic co-meme. 

Bd:^e Co-inome ; Tnose uorpliemes expressed at a level of language which 
has a one morpheme per-word 'ratio. 

Syntactic CO"i;ieme; A theoretical syntactic operation by which sentences 
are combined or altered to achieve efficiency or variation of purpose 
beyond that achieved at a minimal level of language. 



Compression Co-meme ; The theoretical morphemic burden of deep structure 
v/hich is compressed into surface structure through combination or 
deletion transformations. 

Morphemic Co-meme : Morphemes other than those expressed by base 
co-memes, i.e.^ '*The productivity was low." "Productivity'' 
represents one oase co-meme: "product," and two morphemic co- 
memes; "ive" ana ''ity' (p.7). 

The point of reference for Endicott's scale is wha: he calls a minimal 
level of language, defined as "a sentence which contains no optional 
transformations an-i v/niclij barring tense" morphemes , represents a one- 
morpheme per-word i^vel" (p. 7). He proposes to consider phrase structure 
rules and obligatory transformations only v;hen necessary to adjust the 
scale for non-standard dialects. 

Application of the proposed scale for syntactic complexity is illustrated 
by examples. The sentences The clou d was ominous and The cloud was black 
are given as examples of the minimal level of language, i.e#, one morpheme 
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per woru Ct::idiv:ott doas woz oxylain why he regards ominous as a single 

morpheme) ♦ At the rtiniinai iev^tii, there is only one co-meme per word and 

the complexity ratio of each or these sentences is 4/4* The black cloud 

was cm 1 no US naii a jv^ir.piexity ra-'iio of 9/5, since it is regarded as a 

combination of the base co-rriemes of tr^e two constituent sentences plus 

tno co-t;ior!;es involveji i:; the ccn-.b.'ning transformation^ Specifically, 

the sentence is rer.nided as having five base co-memes, three compression 

co-memes (one each for the , cloud, a-nd was which are '^superimposed*'), 

and one syntaciic cc-meme. The higner complexity ratio of the latter 

sentence is regarded as resulting from the application of a syntactic 

resource bey:>nd the minimal level of language. 

Cvvupoundin^i. Jhe^ rn .jn walked £ nii.!:£ ^nd The bo y walked a mile to produce 

The man and the bo y v ;aJ>ked _a rrii le resralts in a complexity ratio of 12/8 • 

The co-meme count includes eight base co-memes, three compression co-memes > 

and one syntactic co-n»eme* The beate n team left t he field is regarded • 

as resulting from the transformational combining of The team left the field 

and Someone beat the team* The complexity ratio of 12/6 is explained by 

Veiorcnct? to tne l)a3o co-:nemes and the several syntactic* co-memes associated 

with the passive, deletion, and adjective transformations involved. The 

genitive in The boy ' s dog is a spaniel is accounted for by combining 

The dog lb a L^panltjl :\i\d Tne bo^ has d uog » The complexity ratio of 11/6 

for the resulting sentence is explained as follows: 

Count one syntactic co-rr.erne, one base co-meme, one compression 
co-meme for tne deletion of ''has'', two compression co**memes for 
the superimposition of ''dog' and '^the'', and one morphemic 
co-meine for th^ possessive inflection* The count is six 
co-memes for a ?;enitive (p*ll)* 
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'io erriphasise tlie usefulness of tne scale as an indicator of level - 
of maturity, Endicott contrasts Tneir imaginative resourcefulness was a 
productive factor v;ita Tne brown dog was a good dog ^ both of which have 
a T-unit ieiigtn cf sevi;:u v/oras. tne former sentence, which obviously ; 
represents a relatively high level of language maturity, has a complexity 
ratio of 'J**/7; the latter sentence nas a complexity ratio of 15/7 • 

Kmlicott rocognisses the possibility that various interpretations of 
a transformational analysis of language niignt be applied in the derivation 
process, tie says the derivation of values is equivalent to exercising the 
predictive function of the theory. Thus, if researchers disagree on 
particular derivuuicns, their differences can be resolved by comparison 
of predictions and researcn results. He also recognizes the necessity 
of isolating a familiarity /actor for each structure, since lack of 
familiarity v/irn words or syntactic operations may block comprehension* 
He suggests the possibility of determining the difficulty factor for 
structures by ui^aiis of clo*:e techniques. 

Another approach tc devising a discriminating linguistic index is 
reported by Golub and Kidder (1974). Their Syntactic Density Score also 
involves T-unit analysis. It reflects measures of T-unit length and sub- 
ordinate clause lenc;th and also takes into account uses of complex verb 
phrase expansions and various kinds of embedded structures* 

'".olub and Ki.1der report that the first step in developing this 
measure of syntactic density was to determine hov; children use specific 
syntactic structures and which ones seem to predict v/hether their writing 



10 



BEST COPY «VAIUBLE 

will be ratea nL;li, mc:diu;n, or low by their teacners. The items included 
in tne i:core were seLc^cto^ ^vcu\ sixty-tnree structures which aad been 
subjected to multivariate analysis* The variables selected were those 
whicii correiitea hiH,hly v/ith teachers ^ ratings of written language samples* 
Caiionioai correiatioii anaiyois was performed to assign relative weights 
to variables acccrain^ to their contribution to syntactic density. 

Tiio t?vnl3ot]v Donsitv Scjve incorporates ten items , including words 
per T-unit, subordinate clauses per T-unit, words per main clause, words 
per -juDoraiiuto cia-.;.ij, na:r.bi»r o: modals, number of b£ and have forms, 
number of pr*epui>itionai phrases, number of possessives, number of adverbs 
of time, ana number of gerunds, participles and unbound modifiers. Each 
item. is assij-ned icadin;% ranging fj^'om ,20 tor words per main clause to 
.95 for woras per T-anit. The loaaing for each item is multiplied by 
frequency, and the resulting proaucts are summed and divided by the total 
number of T-units. The 'quotient is tne Syntactic Density Score, which 
can DG cor.vcrteca tc a rrade level equivalent. A score of .5 is equivalent 
to ryade level 1, ana a score of 10.9 is equivalent to grade level 14. 
Ldca grade level airfers fron-. th« one 'below it by a Syntactic Density ^ 
Score value of .d. 

Tne researcners inuicate tnat a 500 word language sample can be 
scored In aoout tairty 'r-in^j-t^s, but for greater convenience they have 
written' a computer program, v/hicn they have found to be satisfactorily 
accurate and consistent. They believe the computerized scoring process 
nas a number of vaiudble u^e.st 



-10- 

BEST COPY AVAILABLE 

Tiie co;uK.'Uter prov<raai is useful for classroom teacners who want 
to iiagnose er*tc-rinr behavior levels ana subsequent performance 
L^v\;>is ot tnt^ir :^^udent^ and to nv=^tch these levels with reading 
K;aterUi.*i ,*c,ea in oiaas. examination by a teacher of English 
of tne subscorea of a class of students gives an indication of 
sLrt^ncjtbc. ana weaK:. esses that can guide tne teacher ^s planning 
01 objecciveo and ^^rocedures* The Syntactic Density Score of 
tne variety of novels, textbooks and other reading materials 
used in specially designed nfjini-courses or performance-based 
curricaiun.s oaii re deteniiineci quickly, accurately , and inexpen- 
sively (p. 1131) . 

cjjlu:; and KidJer reccv;nii:e that their measure quantifies only the 
syntactic aspect of language, and they report that they are attempti.a to 
prepare a voca^u.a^y intensity inaex which will make it possible to present 
a more precise picture of lan^^uage aevelopment. 

Tne indices of language development devised by both Endicott and 

uoluu appear to havv- rreater capacity to discriminate among various kinds 

of syntactic constructions than io indices such as mean length of T-unitS| 

« 

mean lengtn of clauses, and ffiean number of subordinate clauses per T-unit* 
Upon close examinatinn, hov/ever, a critical observer can hardly avoid 
concluding that zhe quest for an efficacious index of language development 
is not yet ended. 

Enaicott's proposoa scale of syntactic complexity seems to be based 
on an early stae,e oi transformational theory, and he apparently believes 
in tne psycholinsuis ^Ic reality of transformational processes. Perhaps 
Tirj bt>aton t^^am is litore complex than The brown dog , and perhaps the 
difference can be expiainea by differences in the number of transformational 
processes involved in the aerivation of the sentences. But it seems quite 
unlikely that The beaten team is any more complex conceptually than The 
new team, a structure which would get the same complexity ratio as The 
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b rown dog.. Likewise, it U narU to sue how His dog^ is any more coinplex than 
i'he dog , but L'iio latter :5truoture :ats much less weight on the syntactic 
scala. rrouabiy noboay argu'2 that Their imaginative resourcefulness • 

was a proauctivo factor Is not a complex structure, but it is not at all 
Ov>rt.\ln that ttio co:np.Loxity coti be accounted for hy tlie number of morphemes, 
tven if nurib'sr of mcrpn-itnes is a valid indicator, analysts are likely to 
©ncvVuHter occasional difficulty In u^ecidlne what to count as a separate 
morpheme, for examples way shouldn't ominous be counted as omen + ous? 
.'md how many morphemes are tnere in aavertised detergents? 

Thus, while it i-2 ocvious tnat Lndicott's proposed scale has the 
capacity to discriminate one type of structure from another, it is not 
so obvious that it would always discriminate in the right direction. The 
proposed scale probably nierii3 cne kind of testing by researchers that Endicott 
calls for, ana pernaps it will prove to be self-correcting. It may be that 
the famiiidrity factor ca:i i;c- dealt with in a sat '.sfactory manner, but it 
appearri tnat a ^c^tat deul of work rexains yet to be done. 

Golub's Syntactic Density Scale also has considerable capacity to 
aijoritiiinatG ' ypos ot strucLures, and it appears to have been developed 
by empirical procedures. The items included, however, appear to be highly 
reaundant. For ei-,3'r.pl^^ , words per T-unit results from the combined effects 
oi number- of ciausej ard l<snGtn -A clauses, and lengtn cii'.es results 
in part from the remaining items on the scale. Althoug.. i.-if . tatistical 
procedures may be impeccaLie, it appears that some arbitra.-'y decisions were 
made in ajsi»;ning weights, for example, it is difficult to Sfee why participles 
should have a loading of .85 when prepositional phrases are valued at only .75. 
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Incloed, it is to zq^ w.iy ail prepositional phras<s;s should be 

assigned tne samo wei.:»i)t sinco they differ widely in tne difficulty level of 
concepts they represent. 

Although Golub'a r:,,2asure has more statistical evidence of validity 
tnan Endicctc'a, it oeems likely tnat the Syntactic Density Score is less 
precise than it appears to oe, particularly when it is converted to grade 
Xtn'el oquivdlenoy. tiven tiie diversity of individual writing styles, the 
diverse demands imposed on language by different situations, and other factors 
yet unknown, it is hard to see how grade equivalence based on a Syntactic 
Density Score could an used with a high degree of confidence. 

Since neither Golub nor Endicott specify how large a sample of writing 
is needed and how many rr.odes of discourse it should represent, it may be 
thcit these specifioatioi'.s We not yet been worked out. If so, O'Donnell's 
comment on the limitations of earlier raeasures is still applicable: "there 
are no data to show how consistently these indexes measure tne structural 
complexity of an individual's writing in various situations." 

In spite of thg limitations cited above, however, there is little 
uoubt tnat if tiio languas^e sample is large enough and diverse enough both 
Golub 's and Endicott's measures will reveal developmental differences in 
syntactic complexity of language. Since both of them involve rather complex 
and expensive procedures of analysis, however, it seems appropriate to raise 
the question of how much better they are than the grosser measures of T-unit 
length, clause length, and number of subordinate clauses. For if the language 
sample is large enoutjh and diverse enough, there is little doubt' that these 
less precise measures, will also reveal developmental differences in language 
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compieKity, It may turr; ou: x:.2Z in spite of lack of precision T-unit 
ioagth is sxllx tne :r.osu useful and useable index of syntactic devaXopment 
over a wiUo ar.e-ranA^i ana that :rean clause length is the best single measure 
of syntactic. complexity at the high scnool level and beyond. Tne research 
required to find out how the Vdricas indices conpare with one another ought 
to be done, 
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